Simplifying IKE process in a gateway to enable datapath scaling using a two tier cache configuration

ABSTRACT

Computational complexity, specifically, cryptographic operations, is removed from the IKE(Internet Key Exchange) process in a VPN gateway appliance, thereby enabling scaling of the number of datapaths that can be managed by a single IKE process. A two-tier cache configuration enables necessary cryptographic operations on packets in the gateway but does so without placing additional computational burdens on the IKE process. One cache containing security association data is local to the IPSec component of the datapath instance. The second cache is higher level and is populated by IKE with security association data upon completion of IKE Phase 2 negotiations. The local cache is searched first for security policy data and if found is used to encrypt/decrypt the data packet. If not found locally, the IKE centralized cache is searched and if found, the local cache is updated with the security association data.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application claims priority under U.S.C. §119(e) to pending U.S. Provisional Application No. 61/925,145, filed Jan. 8, 2014, and is a Continuation-in-Part of U.S. patent application Ser. No. 14/459,976, filed on Aug. 14, 2014. U.S. patent application Ser. No. 14/459,976 claims priority to U.S. Provisional Application No. 61/866,320, filed on Aug. 15, 2013. All of these applications are hereby incorporated by reference in their entireties.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The present invention relates to software, computer network communications, and VPN gateway components. More specifically, it relates to cryptographic functions applied to data packets in the Internet Key Exchange (IKE) process in a VPN gateway appliance.

2. Description of the Related Art

There is an increasing need for mobile security in enterprises that have users utilizing mobile devices and mobile apps for work or to access services behind the enterprise firewall. Although there is conventional software that provides for a VPN from a mobile device to a VPN gateway, the level of security is often insufficient and provides little information to the enterprise about enterprise-enabled app usage. In this conventional scenario, one VPN is used by all the data from all enterprise apps to the VPN gateway. In other words, the VPN is at the device level. Given the growing number of users executing work-related apps on their personal mobile devices and having to connect to work-related or enterprise-related services through a secure tunnel, having a single VPN pipe for all the data going to and from a mobile device to an enterprise VPN gateway does not provide the level of security needed to prevent hackers and other bad actors from stealing and manipulating the data or from planting malware on the mobile devices that could eventually harm enterprises.

A higher level of security would involve not having multiple apps (let alone an entire device) share a single VPN, in other words, not using a device-level secure VPN tunnel. A better level of security would be to have each app have its own dedicated VPN to a gateway, one that is secure and only transports data for that app or at least one that only transmits data for a federation of apps. In one scenario, each enterprise app on a personal mobile device has its own VPN connection to the enterprise gateway (operated by the user's employer, client, supplier, or the like). Moreover, the enterprise VPN gateway has to be able to manage and create a high volume of VPN connections (e.g., hundreds of thousands of VPNs), keeping in mind that each VPN connection may require a private, unique IP address. The enterprise VPN gateway will also have to be able to terminate this high number of VPN connections and manage traffic coming from and going to internal enterprise servers.

Conventionally, the IKE process in the gateway has to have knowledge of or understand datapaths and IPSec instances within the datapaths. It also needs to know which datapaths are responsible for tunneled flows (the respective security associations and security policies for a flow) in order for IPSec to determine which packets to encrypt, decrypt, and allow through the tunnel. This has caused increased processing and complexity within IKE and made it infeasible to scale the number of datapaths in each IKE process. It would be desirable to remove this complexity from IKE. It would be desirable to have a process where IKE does not need to have any inherent knowledge of datapaths, IPSec instances or their respective security policy and security association mappings and tunnels.

SUMMARY OF THE INVENTION

One aspect of the present invention is a method of performing cryptographic operations on data packets received by and transmitted from a VPN gateway having an Internet Key Exchange (IKE) process. The gateway receives a data packet from a wrapped app via a secure tunnel. The packet is routed to a datapath having an IPSec component in the IKE process. After being routed to a datapath, the gateway determines whether the packet is an IPSec packet. If it is, a security parameter index (SPI) is extracted from the packet. If not, the packet header is hashed to obtain the security parameter index. The SPI is used to search a cache that is local to the IPSec component. If the security association data needed to encrypt or decrypt the packet is found in the local cache, then the cryptographic operations on the packet is performed. If the security association (SA) data is not in the local cache, the gateway searches a second cache which contains security policy and security association data and is under control of the IKE process. This second cache is populated by IKE after it has completed IKE Phase 2 negotiations and may be characterized as the IKE centralized cache. Assuming the SA for the packet is found there, cryptographic operations on the packet are performed. If the SA is not found there, the packet is dropped. In this manner, a two-tier cache configuration enables necessary cryptographic operations on packets in the gateway but does so without placing additional computational burdens on the IKE process.

BRIEF DESCRIPTION OF THE DRAWINGS

References are made to the accompanying drawings, which form a part of the description and in which are shown, by way of illustration, specific embodiments of the present invention:

FIG. 1A is a block diagram showing an overview of the app control process of the present invention;

FIG. 1B is a block diagram showing an alternative embodiment of an app control process of the present invention;

FIG. 2 is a block diagram showing components of an app security program in accordance with one embodiment of the present invention;

FIG. 3 is a flow diagram showing a process of making an app secure before downloading it on to a device in accordance with one embodiment of the present invention;

FIG. 4 is a flow diagram of a method performed in policy manager in accordance with one embodiment;

FIG. 5 is a flow diagram showing a process of a security-wrapped app executing on a handset or mobile device in accordance with one embodiment;

FIG. 6 is a system architecture diagram of the app security control system in accordance with one embodiment;

FIG. 7 is a block diagram showing components and modules in a mobile device needed for implementing a per-app VPN in accordance with one embodiment;

FIG. 8 is a flow diagram of a process of implementing an app VPN in accordance with one embodiment of the present invention;

FIG. 9 is a block diagram showing components of a network illustrating various concepts of the present invention;

FIG. 10 is a flow diagram of a process executed by the gateway device to determine how to route data back to a specific app from an enterprise server in accordance with one embodiment;

FIG. 11 is a screen shot of a display showing a VPN gateway active session monitor screen in accordance with one embodiment;

FIG. 12 is a block diagram showing some of the relevant components of the present invention in the gateway component;

FIG. 13 is a flow diagram of a process of obtaining security association data for a packet without involving the IKE process in accordance with one embodiment;

FIG. 14 is a block diagram showing data and interactions between the SPD/SA cache and the IPSec local cache;

FIG. 15 is a block diagram showing in more detail what data is populated in the SPD/SA cache in accordance with one embodiment; and

FIGS. 16A and 16B are block diagrams of a computing system suitable for implementing various embodiments of the present invention.

DETAILED DESCRIPTION OF THE INVENTION

Example embodiments of an application security process and system are described. These examples and embodiments are provided solely to add context and aid in the understanding of the invention. Thus, it will be apparent to one skilled in the art that the present invention may be practiced without some or all of the specific details described herein. In other instances, well-known concepts have not been described in detail in order to avoid unnecessarily obscuring the present invention. Other applications and examples are possible, such that the following examples, illustrations, and contexts should not be taken as definitive or limiting either in scope or setting. Although these embodiments are described in sufficient detail to enable one skilled in the art to practice the invention, these examples, illustrations, and contexts are not limiting, and other embodiments may be used and changes may be made without departing from the spirit and scope of the invention.

Methods and system for preventing device software applications from infecting or otherwise damaging a device, in particular, a mobile device, are described in the various figures. These types of applications, used often on a variety of mobile devices, such as smart phones, tablet computers, gaming devices, and portable computing devices are commonly referred to as “apps.” These apps may also be downloaded on to non-mobile devices, such as TVs, computers, automobiles, and other emerging smart device categories. Methods and systems described are not intended to be limited to operation on mobile devices. These device programs or apps have proliferated and are now very prevalent. Currently, apps are typically written in either Java or C. The methods and systems described herein may be applied to apps written in either or to apps written in other languages for different platforms. Most apps, if not all, have to communicate with the mobile device's operating system to get a specific service that the app needs in order to perform its intended function and this service is usually only available from the operating system. A common example of such a service used is GPS to get the location of the device which the app may need. However, because of this exposure, apps are a vulnerability for the device and pose a security and privacy risk for the user. Companies want to be able enforce a centralized policy to control and secure access to its data and software. This is also true for end users (i.e., individuals, home users, and the like). It enables enterprise IT departments to maintain governance of corporate data. The methods described below provide a centralized way to control security with respect to apps that are downloaded onto mobile devices, where the devices are either an employee's personal phone or an employer's phone, so that those apps do not pose a security threat. Various embodiments of the invention may also be used by parents and individuals (i.e., in home or non-work environments) to ensure that their personal mobile devices are safe from malware and may also be used to apply controls, such as on usage. Embodiments of the app control software of the present invention may also be used for mobile device data protection and back-up and for application-level telemetry.

FIG. 1A is a block diagram showing an overview of the app control process of the present invention. It is a generic description of one process without being tied to a specific configuration or environment. An app 102 is provided by app provider 100 which can be any type of entity (individual, software developer, employer, etc.). It is generally unprotected and the only security surrounding it is provided by the operating system. The only shield and checking done on how it executes on the device once loaded is provided by the operating system.

The present invention enables additional security of the apps that is not provided by the device's operating system. A security application program 104 is applied to app 102. Or the app 102 is input to program 104, which may be supplied by a third-party app security provider. In one embodiment, security application program 104 has a policy manager and a policy wrapper which may be in different locations. They are described in greater detail in FIG. 2. Once security program 104 has been applied to app 102, the app is wrapped with a security layer so that the device is protected. It is shown as secured app 106. In one embodiment, secured app 106 is then downloaded onto a mobile device 108, such as a smart phone or tablet computer, where it executes securely without risking damage to device 108. Another benefit is that secured app 106 may also be managed by the company or other entity that is providing the app to the user, such as an employer providing the app to an employee. For example, if the user leaves the company, the company may automatically delete the app and any related data from the device. In another example, a parent may be able to limit the apps used by another person (e.g., a child) or to limit the amount of time, e.g., 10 minutes a day or limit which Web sites may be accessed by an app. Or, a parent is concerned that an app is leaking a child's location to unknown third parties. There may be numerous other examples. As noted, FIG. 1A is intended to show the general process of securing an app and downloading it onto a device. Note that in this embodiment, app 102 is not made secure from causing harm to the device after it is downloaded onto the device, but before. In another embodiment, the app is secured after it is downloaded onto the device, but before it can interact with the operating system.

FIG. 1B is a block diagram showing an alternative embodiment. An unsecured app 110 (also supplied by an app provider) is downloaded onto mobile device 112. In this embodiment, however, there may be a specially designed app on device 112 that blocks the actual installation of unsecured app 110. The special app (not shown) redirects unsecured app 110 to an app security program 114. The unsecured app 110 is wrapped in a security policy, the resulting app shown as secured app 116. It is then downloaded and allowed to be installed on device 112 by the special app. In this manner, an individual or home user, for example, who wants to protect her phone from security threats posed by apps, can have apps made secure (wrapped) by a third-party service or by her mobile phone carrier, to mention only two examples, before they are downloaded on to her phone. It should be noted that this security wrapping can be done to an app regardless of where the user downloads the app from. It may also be noted that in FIGS. 1A and 1B, the network and connections between the components and software are shown generically. The transmissions are primarily over the Internet (not shown) but may also be within a private network or both.

FIG. 2 is a block diagram showing components of an app security program in accordance with one embodiment of the present invention. In one embodiment, the security program has two major components, a policy manager and a policy wrapper. A policy manager 202 accepts input from an administrator or other individual who is responsible for setting security for the mobile device. The person may be referred to as the governor since he is governing the security of the one or more mobile devices. The security policy may be set using various user interface screens. There are numerous examples of policies, including geo-fencing (e.g., the app can only be used in a building) and others. The service provider or the entity providing the app security program may also provide default policy and security settings which may be useful for home users. Examples of policy settings are described below. Policy input 204 is inputted into policy manager 202. Policy manager 202 takes the input/settings from the governor and creates policies or meta-data 206. The format or form of meta-data 206 can vary. They essentially reflect the policy settings from the governor.

Metadata (policies) 206 may be used as input to a policy wrapper 208. In one embodiment, this component of the program takes the policies and uses them to secure an app 210 by wrapping it. Wrapper 208 receives an app 210 from a handheld device 212. In one embodiment, wrapper 208 receives a copy of an app 210 instead of the original app 214 that was downloaded onto phone 212 (see FIG. 1B above). Here the handheld device 212 user attempts to download an unsecured app 216 from an app provider 218. In the scenario in described in FIG. 1A, it may operate on the app itself instead of a copy. This may be the case where a market place or app store offers customers a secured version of the app along with an unsecured version (or only offer the secured version). A secured version 220 (security-wrapped version) is returned from policy wrapper 208 to device 212.

Metadata 206 may also be used to update a local policy file (an existing policy that is already on the device). A local policy file is used to update policy parameters residing on device 212. For example, in the case of “geofencing” (i.e., restricting use of an app to an certain physical areas) it is likely that the GPS locations controlled by the governor will change over time. When such a change occurs, the new policies can be applied in two different ways. One is to generate a new policy and apply it to the original app (i.e., wrap the app with the new policy). Another way is to allow dynamic configuration based on a local policy data file with the “variable” part of the policy encrypted/signed inside it. For example, an IT person may want the ability to override a configuration on a device directly through an IT app residing on the device for diagnostic purposes.

In one embodiment policies have two components: a fixed part and a variable part. The fixed part is the content described in the policy file (e.g., “protect the GPS at certain times of day”). The variable part typically is provided by the governor through a console (e.g. “what are the times when the GPS should be protected?”). The variable part can change without applying a new policy.

Policy designers can choose to forego the variable component of the policy and basically “embed” all data or content statically in the policy file. In this case, the console does not have any way to customize the policy.

If the policy designer chooses to include some variable component in the policy, when changes are made to the variable data (on the console), a new data file could be sent to the device to reflect the latest changes. Such a file would be encrypted/signed (to prevent a malicious app circumventing the policy), downloaded to the device, and used by the app security code on the device to apply the new data to the appropriate policy.

Such changes and updates may be done by local policy update component 222 at runtime. This component creates updated policy parameters on device 212. Thereafter, wrapped app 220 will use the updated policy parameters.

In one embodiment, policy manager 202 and policy wrapper 208 are components in the same app security program and may operate on the same computer. In other embodiments, the manager and wrapper components may be on separate computers. For example, the policy manager 202 may be on a server at one site and the policy wrapper 208 may be on a computer at another site and may be managed by a different entity or the same entity. Collectively the manager and wrapper form the app security program which, in one embodiment, is operated by a security service provider. It may also be provided by an enterprise, such as a company, employer, business partner, and the like, or by a mobile phone carrier.

FIG. 3 is a flow diagram showing a process of making an app secure before downloading it on to a device in accordance with one embodiment of the present invention. At step 302 a copy or clone of the app that is to be secured is made on the device. In one embodiment, this may be done on the mobile device itself or may be done off the device, for example, on components on the Internet, in the cloud, on an enterprise's server or on a carrier server. The user may be an individual, an employee of a company or other entity. As is known in the field, an app may be obtained in a number of ways, most typically from an app store or an app market, or directly from the app developer or provider or in any suitable manner. By making a copy, the original app is preserved giving the user an option to use either the secured or unsecured version and also protects the user's ability to use the app if something goes wrong with the app control process. Note that in one embodiment, the app is not yet downloaded on to the phone. In one embodiment, the methods described below are performed on separate computing devices. In another embodiment, the process may be performed on a mobile device, but the app is only executed on the device after the process is complete and the app has been made secure.

At step 304 the app is decapsulated. Most, if not all, apps have digital signatures signed by the author/developer. At step 304, as part of the decapsulation, the digital signature is removed from the app. This may be done using techniques known in the art. Decrypting the app may also be performed at this step. These and other steps provide the core object code of the app which may now be operated on by the app control program. The nature and specifics of this operation may depend on the mobile device's operating system.

There are several examples of operating systems for smart phones such as iOS (for the iPhone), Android (used on handsets from various manufacturers), Windows Mobile 7, Web 0/S, Palm, and others. At step 306, the core object code app may be either disassembled or decompiled to obtain the executable object code. For example, it can be either “native code” (CPU instructions) or bytecode (virtual machine instructions, such as Java or .Net). In one embodiment, this may be more of a modification process if the device runs iOS where the disassembly is closer to a process of locating and substituting certain links and terms. However, in general, the disassembly process to obtain the object code of an app after it has been decapsulated may be done using techniques known in the art, such as using disassemblers.

At step 308 the app object code is augmented with object code from the app security program. For example, this object code may include class files which are replaced with class files from the security program. The object code generally provides an interface to the mobile device operating system. The app control security program object code is derived, in part, from the policy/meta-data described above. In the case of iOS, the operation is different in that a ‘locate and substitute’ process occurs rather than an object code replacement. This takes into consideration an interrupt approach that iOS's uses. Generally, the app security program goes through the assembly language code. The specific items located are Software Interrupts (SWIs) within the object code and which are replaced with a branch to an app control security program layer which may then determine what further actions to take, such as making the request, enhancing the results, and others, as described below.

At step 310, after substitution of the object code (or substitutions of SWIs) has been made, the app security program prepares the security wrapped app for execution on the mobile device. The object code substituted into the app by the security program generally provides a bridge or connection between the app and the mobile device operating system. The security program class files may be described as wrapping around the operating system class files. The app security program class files are generated based on the policies created earlier (by input from the governor). The app is essentially re-wired for execution on the handset. It is re-wired to use the app security program layer in addition to the security provided by the mobile device operating system layer. That is, the secured app may still be subject to the security provisions of the operating system. In one embodiment, certain cosmetic changes may also be made to the app, such as changing the icon for the app to reflect that it is secured. By doing this, the user can be sure that when the app icon appears on the handset screen that the secured version of the app will be executed. The app has now essentially been re-factored or re-programmed by the security program.

At step 312 the app is signed with a new key, for example, with the key of the service provider or the key of the enterprise providing the secured app. The re-factored, secured version of the app is returned to the handset device. In another embodiment, the app is wrapped with the security layer on the phone. At step 314, in one embodiment, the original, unsecured copy of the app is deleted from the handset device. This may be done by the secured version of the app once it is downloaded onto the handset. In other embodiments, this is not done and both versions remain on the mobile device. At this stage the process is complete.

FIG. 4 is a flow diagram of a method performed in policy manager 202 in accordance with one embodiment. At step 402 the governor or other security policy individual is enabled to define, generate, and create security policies. This may be a network administrator for an enterprise deciding a vast array of mobile device security policies for hundreds of employees using dozens of enterprise apps (specifically for work) that may be downloaded on hundreds or thousands of mobile devices. On the other end of the spectrum, it may be a parent who is setting security policy for three or four apps downloaded by her child on a new mobile device. Other examples include preventing or squashing a gaming app using GPS, preventing an app from using a microphone on the device to record or eavesdrop on a conversation, among many others. In either case, the governor may take into consideration the category of the app, the type and nature of app, the author, the age-appropriateness, and numerous other factors. For example, has the same author written any other apps that may have been classified as malware or posed a security threat to the device. It may determine whether there are other apps by the same author. It is at this stage that the governor decides which rules to apply for each app. In one embodiment, this is done off-line by the governor. That is, it may be done using user interfaces on a home computer or on an enterprise network computer used by an administrator where security templates provided by the security program service provider (essentially default templates) may be used or very specific rules may be set using the templates.

At step 404 the security data input at step 402 is used by the app control security program to create the actual policies. At step 406 the app control security program object code is generated based on the input from the governor regarding security policies created at step 404. The governor or service provider may also update existing security policies if needed. As described above, the object code may be used to enhance certain original object code obtained from the disassembled app. The enhancement code is inserted to adjust security and privacy settings for an app in order to protect the enterprise and end user. The original app's behavior is altered which allows the governor to control how the app behaves. For example, if an app stores sensitive account information in the clear (i.e., un-encrypted), the behavior could be changed so that all information the app creates is stored in encrypted form and which can only be accessed by that app given that the key to the stored, persistent data would be unique to the app. In many instances the enhancement code can improve the apps performance since the code is optimized for a particular use scenario.

FIG. 5 is a flow diagram showing a process of a security-wrapped app executing on a handset or mobile device in accordance with one embodiment. At step 502 the behavior of the app when the app executes or immediately before it executes on the device is altered or modified. For example, behavior modification may include authentication during app initialization; e.g. smart/CAC card, or password challenge. Some apps, as originally designed, may not require a password for security, however, a secured version of an app which has been modified may require that the user enter a password. At step 504 the secured app executes on the mobile device by the user activating it (e.g., tapping on the icon if the device has a touch screen). Upon execution of the app, in one embodiment, control can take one of four options. As is known in the art, when an app executes, it makes calls or requests to the device operating system in order to carry out its functions. In many cases these calls may be harmless or pose no significant security threat to the phone or device. If this is the case, the call may be allowed to pass to the operating system as shown in step 506. Here the call is made to the device operating system and the app executes in a normal manner.

If the security layer or wrapper around the app detects that the app is making a request that may pose a security threat to the device, the app security layer may enhance or modify the request before it is passed to the operating system or other software or hardware component in the phone. This is shown at step 508. In one embodiment, the governor determines which calls are permissible by examining the one or more policies. For example, the governor may determine that all data should be saved in encrypted form. In another example, the governor may decide that only a select group of trusted apps should have data on a soldier's GPS coordinate. In one embodiment, there is no runtime logic to determine what is safe, a potential threat, or an actual threat; it is essentially pre-declared by the governor in the policy created at step 404 above. In another embodiment, there may be some runtime logic. For example, an app may be trying to send out expensive SMS text messages. The app control program may determine this and block the app from sending more than a certain number of text messages, for example, it may limit it to transmission of one message. The enhancement may be adding something new, such as a password requirement. In another example, if the call is to save data on the mobile device memory, the secured app may actually back up the data to a storage area in the cloud or on the Internet (i.e., off the device). In another example, the data related to the call may be encrypted.

At step 510 the secured app may determine that the call is an actual threat and should be dealt with in a more severe manner than at step 508. For example, it may have decided that based on the policy for an app, that if a camera on the device is accessed while in a secure building (e.g., the Pentagon), the app should immediately be terminated. Merely enhancing the request may not be sufficient in this case. At step 510, the request may not be allowed to proceed to the operating system or any other component of the device. However, in one embodiment, a response is returned to the app, but that response is intentionally not accurate or correct. It is essentially an obfuscated response. For example, it may be a GPS coordinate that is not the actual physical coordinate of the device (e.g., the device is in California, but the GPS coordinate that is returned to the app is a coordinate in Nebraska). This may be desirable when apps are used by children. Other examples may be returning bad or garbled data results if an app that should only run within a restrictive environment (e.g., a secure office area) is determined to be running outside that environment (e.g., at home). In this example, the app may be partially crippled so that the app can only access unclassified data and wherein classified information is nullified. In another example, when a user is attempting to paste or copy sensitive data from a classified app to a non-classified app, the app control program may change the copy of the data that is being pasted to garbage or essentially make it meaningless. After either steps 506, 508, or 510 have completed, the security-wrapped app continues execution on the mobile device at step 514.

At step 512 the security layer around the app has determined that the call being made by the app or that the app execution behavior in general poses too high a security threat level to the mobile device. In this extreme case, the security layer decides to terminate execution of the app and/or delete the app. For example, the app may be using too many resources on the phone, such as bandwidth, or is making too many high-risk calls to the operating system thereby over-exposing the mobile device. In this case, the app can simply be deleted from the phone or the app may be terminated. The user may not be able to re-execute it or re-install it. For example, an employee may not install that app again on the company phone because it was exposing sensitive company data. Or it may be determined that an app is secretly collecting data on the phone or installing malware.

FIG. 6 is a system architecture diagram of the app security control system in accordance with one embodiment. A trigger manager component 602 handles two events, one for generating a new policy 604 and another for updating policy parameters 606. Such events can be triggered by various systems. For example, a console administrator or governor might apply a new policy to all devices (a manual operation). Or a network monitoring application, after detecting suspicious traffic originating from a device (or app), could push a new policy that would prevent a user/device/app from accessing network resources (an example of an automated operation). The various systems or entities that have the authority to change/update polices, do so through the trigger manager 602.

New policy output 604 is input to a policy definition file 608 which may be generated at runtime and may include various types of code and extensions, for example, specific to the app control service provider, or to the app/user/device the policy applies to. Policy definition file 608 is input to a policy compiler 610 which has two outputs. One output is a wrapper definition file 612. This file is input to an app wrapper component 614. App wrapper component 614 is responsible for generating secure app by injecting custom binary code (native or bytecode) into an app, downloaded directly, for example, from an app store. Or the app could be an app the user downloaded on to his device, and then uploaded to an “AppControl” server.

App wrapper component 614 may have three inputs: apps from one or more app stores 616, certificate key management data from identity management component 618, and hardened components 620. Key management data is used to tie the identities of the user, device, and the app, and ensure that any operation subject to policy control can be tied to a specific user/device/app. This also ensures that a wrapped application can only be run on a specific device to prevent a malicious app from circumventing policies and hardened components 620 (for example “Device security framework”). The output from app wrapper 614 is a wrapped app 622 which is downloaded or installed onto mobile device 624 via the device's controller 626. Device controller 626 responsibilities include: download app from the app wrapper; ensure that app running on the devices are appropriately wrapped apps (e.g., app wrapped for user1 should not be installed/run on device for user2); report list/version of installed applications to allow the management console to control policies for each device/user/application; and download policy parameters when appropriate. Wrapped app 622 resides on device 624 coupled with policy parameters 628.

Returning now to policy compiler 610, the other output is a runtime policy definition file 630. This file is input to a runtime policy compiler 632 which also accepts as input policy parameters 606 (specified by the management console, or other subsystems). Output from compiler 632 is a device runtime policy file 634. This file 634 is downloaded onto device 624 as shown as policy parameters 628, and is used to customize the policies applied to wrapped app 622.

Described below are various use cases and capabilities of the app control security program of the present invention. One use case involves the separation of work life and personal life on a mobile phone. There are apps for the user's personal use and apps that the user's employer (or a business partner of the employer) may have provided and the apps operate on the same phone, which is often the user's personal phone. The governor who determines security of the apps that need to be secured on the user's phone may block copy/paste operations between apps (such as e-mail apps). The governor may set policies for the work-related apps that perform selective wipes of apps and associated files. User location-based policies may also control where certain apps may execute. Examples of levels of protection because of malware are denying access to contacts, denying transmission of SMS without consent, and the like.

Another example of a use case is app control. Using the present invention, white and black listing of apps may be implemented, as well as full deletion of apps according to the policies set by a governor. An app may be ‘sandboxed’ to protect the other apps, software, and hardware of the device. Other capabilities may include identity-based control of apps or services and highly granular control over app behavior. Trojan identification is another use case that can be implemented with the app security program. For example, each app and content may be encrypted to prevent rogue apps from gaining access to and stealing confidential data on the phone. The security program may also be able to identify anomalous system call behavior of an app to identify malicious Trojan apps that act outside of their published intent.

Another use case is back-up and recovery of app data in which IT security administrators and governors have data revision control and can implement app and device content migration through back-up and restore operations. In another use case is network traffic monitoring. The app on the mobile device may be brought under the visibility of existing enterprise IDS/IPS/Web filtering infrastructure to allow for inspection and control of app communications. The app security program can also integrate with third-party DNS services, such as Symantec's DNS service to identify malware. All app communications may be encrypted, including communications at the mobile phone service provider. Other use cases include session continuity, consumer privacy (e.g., GPS obfuscation, implementing safe DNSs), and intercepting payment/transaction messages from the mobile device (i.e., operating in the middle of mobile commerce streams).

In one embodiment, the app security service is offered by a third-party service provider, for example, to make apps used by end-users or individuals (i.e., users not associated with an employer or enterprise). For example, a parent may want to obfuscate the GPS of a child's phone because the parent does not want a social network site, such as Facebook, to know where the child is, essentially disabling GPS. In another embodiment, an app store, operated by a wireless phone carrier (e.g., Verizon, AT&T) may offer a secured app for an extra charge or premium. A customer of the carrier can download the secured app from the marketplace or online store instead of the unsecured version by paying an extra amount. In another embodiment, an enterprise may have its own app store for its employees, partners, and the like, where users can only download secured versions of the apps (which may be referred to as “hard” apps). These apps may have many of the security features described above as defined by a governor (security administrator) at the enterprise, such as blocking copying and pasting e-mail or corporate data, killing an app from the user's phone if the user leaves the company, and so on. A mobile phone carrier's DNS can typically access any site, but the app security program can block a mobile device browser so that it can access only a safe DNS (e.g., Symantec's DNS) from where only safe Web sites may be accessed. In another embodiment, the app security program provider can work with the mobile device manufacturer to incorporate the app security program or functionality into the hardware and software operations of the device. In this embodiment, described below, a user can download an unsecured app and make is secured on the phone or device itself before executing and does not have to access a third-party service to have the app secured or ensure that the app is secured before being downloaded onto the device.

As can be seen from various embodiments described above, the security of the mobile device extends beyond the device itself and is applied directly to the apps that are downloaded onto the device. Companies and other entities are able to take advantage of apps more freely without having to worry about the security risks, such as data leakage or malware infection of the company's enterprise IT system. Companies can maintain governance of its corporate data.

In another aspect of the present invention, a VPN is created and utilized by individual apps on a device. That is, an app has a VPN tunnel to communicate with, for example, a corporate VPN gateway. This VPN tunnel is used only between the single app on the device and the VPN gateway. In one embodiment, each security wrapped app has its own IP stack or, more generally, VPN stack. Methods and systems for creating a VPN tunnel for a specific wrapped app on a device are described in FIGS. 7 and 8.

As noted above, conventionally, a VPN tunnel is built (on the client/device side) on top of the system UDP or TCP modules, which in turn communicate with the VPN gateway. In the described embodiment, the device may be a smartphone, tablet, or other mobile device. In other embodiments, the device may be a PC or a laptop. It may also be a wearable device, such as a watch, goggles, or other nomadic Internet-enabled computing device. In yet other embodiments, the device may be any Internet-enabled appliance or system. Examples include cars that have Internet access, household appliances (refrigerators, washers, etc.), or HVAC, home heating/AC systems, or security systems. As noted, the described embodiment uses mobile devices where users download apps. However, the present invention may also be used in other embodiments and contexts.

Generally, there is software on the device, such as a smartphone, tablet, PC, or other Internet-enabled device, that allows it to make a VPN connection to a gateway device. However, the described embodiment provides a more compartmentalized way of creating and utilizing a VPN tunnel on the device that is more secure. In one embodiment, each wrapped app has its own VPN tunnel. In another embodiment, some or all apps in a federation of apps on a device have the option of sharing one VPN tunnel. A secure form of IPC may be used to communicate between the processes in the case of a “federated” application. As described in more detail below, IPSec packets may be built for each wrapped app (each wrapped app operating in its own sandbox, that is, outside the device operating system). The IP packets are transmitted via a proxy or virtual data link layer (part of the app VPN stack) to a native UDP module in the operating system. From there it is transmitted to a VPN gateway. As noted above, the device may also be a PC. For example, a PC running a wrapped version of Microsoft Outlook may have its own VPN tunnel from the PC to the gateway

In order to establish a VPN tunnel, an initial step is to create only packet types that a typical operating system makes available to applications. For example, to connect to an IP-based VPN, an application can use TCP or UDP packets. In the described embodiment, the app IP stack uses UDP packets. Generally, it should not use “raw” IP packets (without a specific Layer 4 protocol), because such packets are typically reserved by operating systems so that only privileged applications may send or receive them. A per-app VPN, as described in various embodiments of the present invention, is not required to be a privileged process on the device. NAT traversal of IPsec packets is enabled using UDP protocol (as described in RFC 3947 and RFC 3984), rather than raw IP packets.

In order for an app to send data through a VPN tunnel to a gateway, the app must be able to build IP packets which are then encapsulated using VPN software on the device. The IP packets are built using an IP stack. This software typically resides in the operating system of the device. Therefore, in order for a per-app VPN to operate, in one embodiment, the app makes use of an IP or VPN stack that is used to build a tunnel that is only used between the single application and the gateway. That is, in one embodiment, it is not used by other apps. This stack may also be referred to as a per-app IP stack, in the app sandbox. The goal being to generally replicate operating system functionality in the sandbox.

In order for any IP stack to access an outside network, it uses software referred to as data link interface (also known as Layer 2, per the TCP/IP and OSI networking models). In one embodiment of the present invention, and as shown in FIG. 7, this data link interface is implemented as a proxy (or virtual) data link interface to an underlying operating system's (native) IP stack. In one embodiment, given that only UDP packets are sent and received, this proxy data link interface supports sending and receiving UDP traffic via the native operating system's IP/VPN stack and the per-app IP/VPN stack. An IPsec module decrypts the inbound traffic coming from the virtual data-link layer, and encrypts the outbound traffic from the per-app IP stack.

IPsec is typically implemented as a “bump in the [IP] stack”, and is integrated into an operating system's IP stack. In one embodiment, IPsec is implemented in the per-app IP stack instead. As noted, a proxy or virtual data link interface for IPsec is defined in the per-app IP stack. The IPsec encapsulates traffic (IP packets) built through the per-app IP stack and routes the traffic via the proxy data link interface, in one embodiment, to a UDP module in the native operating system space.

FIG. 7 is a block diagram showing components and modules in a mobile device needed for implementing a per-app VPN in accordance with one embodiment. Shown are the two relevant memory spaces on a mobile device 702: a sandbox area 704 containing components of a wrapped app, system components, and an IP/VPN stack, and a device operating system space 706. Also shown, external to mobile device 702, is a VPN gateway component 708. As noted above, embodiments of the present invention involve creating and implementing an IP stack for a security wrapped app in sandbox area 704.

An IP stack 710 has some of the conventional layers of a TCP/IP stack, such as network layer and transport layer. Above IP stack 710 is an HTTP proxy or sockets layer 712. Two components in app IP stack 710 are IPsec 714 and proxy data link layer 716. Virtual data link layer 716 functions as an IP interface to between virtual IPsec 714 and native UDP module 722. The combination of virtual IPsec 714, virtual data link layer 716, and IP stack 710 may also be referred to as a “per-app VPN” stack. The primary goal of the per-app VPN stack of the present invention is to replicate operations and functionality that take place in native operating system 706. Components for a single app and any system components reside in sandbox area 704. However, for illustration, only one app component 718 and one system component 720 are shown.

Native operating system 706 contains several components, including a system-level or native IP stack (not shown). One of the modules in operating system space 706 needed for the present invention is a UDP module 722. As noted, IPsec packets are transmitted from mobile device 702 to VPN gateway 708 using UDP. In other embodiments, TCP may be used. Data packets for the security wrapped app are also received from VPN gateway device 708 at UDP module 722 and relayed to proxy data link layer 716.

FIG. 8 is a flow diagram of a process of implementing an app VPN in accordance with one embodiment of the present invention. A security-wrapped app in sandbox 704 executes in a normal manner and in the process makes calls to the device operating system, some of which require communication over a VPN. This occurs at step 802. In one embodiment, those calls are re-directed to app IP stack 710 at step 804. In one embodiment, The interfaces that are exposed to the application (and app and system components, boxes 718 and 720) through HTTP proxy layer box 712 mirror the corresponding interfaces provided by operating system 706.

At step 806 the app IP stack builds IPsec packets in a conventional manner. This would normally be done by the system/native IP stack as a result of an app or system component in the sandbox making a call to the operating system. IP packets are then encapsulated using IPsec in the app VPN stack. At step 808 a proxy data link layer, also part of the app VPN stack, transmits packets to a UDP module in the system/native IP stack. In other embodiments, where an SSL VPN is implemented, the packets may be transmitted to a TCP module. As noted, the proxy (virtual) data link layer functions as an interface between the app IP stack and the system native stack, specifically the UDP module. The IPsec module can be described as a virtual IPsec interface. This interface together with the virtual data link interface functions to get IP packets down through the app VPN stack and out through the native UDP module to the VPN gateway.

It may be noted that this is possible by virtue of using network address translation (NAT). As is known in the art, this technique allows an entity to allocate “private” IP addresses which are mapped to public IP addresses. The private IP addresses are never seen by the public. Traditional NAT approaches are able to rewrite an IP packet and then send it to a different, private address.

As noted above, by configuring a particular application with its own VPN tunnel, one could configure the app security wrapping server to restrict access to network resources so that the app can only access the specific resources it needs. If HTTP proxy 712 is listening on a TCP port provided by operating system 706, other applications could potentially connect to that TCP port. In one embodiment, HTTP proxy layer 712 implements techniques to prevent other applications on the device from accessing the HTTP proxy. In one embodiment, the system determines if a connection to the HTTP proxy layer 712 came from the current process (i.e., the wrapped, host app), for example by looking through all the file descriptors in the current process and checking with the kernel to determine if any of file descriptors made the connection to HTTP proxy layer 712. If the authenticity of a group or federation of apps can be validated and establish communication between those apps, in one embodiment, the federated apps can share a single VPN tunnel. Although this would lower the number of concurrent VPN tunnels needed for a device, it may detract from some of the advantages described above, such as configuring a server to restrict access to network resources such that the app can only access the specific resources it needs.

As such, in one embodiment, only the specific security wrapped app can connect to the VPN tunnel that is created by that app's VPN stack. This prevents another malicious app or any other malware on the device from stating that the HTTP proxy for another app is also the proxy for the malicious app, thereby preventing the malicious app from using the wrapped app's VPN. In another embodiment, a VPN tunnel created by a security wrapped app may be shared by other wrapped apps in the same federation or group as the app that created the VPN.

In another embodiment, a device user may need a VPN to gain any type of access or connectivity at all (e.g., if the user is working abroad in a country with restrictive Internet access), the user can access a device-level VPN to gain initial full-internet access, and then use a per-app VPN to gain corporate access.

Another advantage of tying a single application to a VPN tunnel is the ability to restrict, at the VPN gateway, the network resources that the application has access to, by using a different “configuration profile” (i.e., VPN group) per application.

Now, in another aspect of the present invention, network security is enhanced from the perspective of the gateway owner (the gateway was first described in FIG. 7 showing a VPN between an app on a device and gateway 708). That is, security of components behind the gateway, such as a corporate or institutional network, is better protected. As described below, in one embodiment, security is enhanced by virtue of the ability to report a jail broken or rooted device. The reporting may be done via the private tunnel to the gateway which, in turn, can report to the appropriate network components. In yet another aspect of the present invention, the gateway is able to scale in order to manage a large number (e.g., tens or hundreds of thousands) of private tunnels, such as VPN or IPSEC connections, with apps on mobile devices.

As described, an app security provider, such as Mocana Corporation of San Francisco, Calif., supplies the software needed to wrap the app according to selected and default policies on the device. The app security provider can also provide software that enables the wrapped app on the device to create a private tunnel, such as a VPN or IPSEC connection, between the app and the gateway which may be under control of, for example, a company, such as the user's employer. The app security provider may also provide the software and hardware comprising the gateway to the company. In one typical scenario, the user of the device is an employee of a company. The device is in many cases owned by the user (it may also be owned by the company and given to the user for work use). The device runs one or more apps that are provided by the company to the employee/user or are in some way related to the employee's work for the employer, for example, an e-mail client. The apps are wrapped by the employer (for example, by the IT department) using the security wrapping software purchased from the app security provider. Once wrapped they are given to the user before the apps are used by the employee (herein referred to as “user”).

In one embodiment, each user has one device (or more) which may have multiple work-related, security-wrapped apps. The employer may have hundreds or thousands of users, each using their own device having one or more wrapped, work-related apps. Each of those apps, when running, will have its own dedicated private tunnel with the employer's gateway (gateway 708 above) which, in a conventional setting, means that the employer will potentially have to manage a very large number of unique IP addresses. These unique IP addresses will likely be 10 net addresses. The IP addressing issues and having to route traffic back to the apps from the gateway will almost certainly be, for the employer, an insurmountable or, at the least, a significant technical and logistical challenge.

It would be desirable to provide means and methods for reducing the number of unique IP addresses that an entity, such as an enterprise, would have to assign and manage, while concurrently being able to scale up the number of wrapped apps and their private connections with a gateway. That is, keep the advantages of a per-app VPN embodiment but reduce the administrative and technical drawbacks and impediments of implementing these embodiments.

With the increase in the number of user devices that can connect to the gateway, it would also be desirable to be able to inform the gateway of jail broken and rooted devices in order to maintain security of the internal enterprise network.

As noted, an issue that will concern enterprises using the gateway is potential breach of security of the enterprise network (behind the gateway) from jail broken or rooted devices. As noted, the mobile devices will, in most cases, belong to the employees (reflecting the increasingly common “bring your own device” BYOD environment) and the employer will want to prevent or be warned if users use jail broken or rooted devices. Or it will at least want to know when such a device is being used to connect to the gateway. For a large employer with thousands of users and, consequently, thousands of different devices connecting to its gateway, there will likely be a non-trivial number of devices that are in fact jail broken or rooted. The employer will want to know when such devices connect to the gateway.

As such, it would be desirable to enable a wrapped app to report to the gateway, via the app's private tunnel, whether it is running on a jail broken device. It is helpful to note here that the app security software running on the device that wraps the app performs the actual detection of whether the device is jail broken. This jail break detection is a separate operation from the function relevant to the present invention, namely, reporting whether the device is jail broken to the gateway (i.e., to the employer or enterprise).

Before describing two phases of establishing a connection between an app on the mobile device and the gateway, it is helpful to provide a brief overview of the capabilities of the gateway working with a wrapped app on a mobile device which, from the perspective of the gateway, may be characterized as a trust point. As described below, the functionality and capabilities of the present invention stem from the enterprise owning both the gateway and the wrapped app, both of which utilize app security software. As noted, the security provider provides software for wrapping the app and software for the gateway (and may also provide the gateway itself.

At the center of the gateway's capabilities is the ability to terminate a large number of private connections. These connections may use VPN/SSL, IKE/IPSEC or other protocols or standards. In the described embodiment and for illustrative purposes, the gateway terminates IKE/IPSEC sessions that originate from wrapped apps. Another central capability of the gateway is the ability to enforce a higher level of authentication or security between a wrapped app and the gateway. As noted, one feature of this is the ability of a wrapped app to report a jail broken or rooted device. Another capability of the gateway is getting detailed traffic data related to data packets coming from and going to wrapped apps. The gateway has other capabilities, some of which use an app-specific database.

A process of establishing a secure tunnel or connection to prevent eavesdropping or tampering between apps on the mobile device and the gateway can be described in two phases. In one embodiment, Phase 1 may be characterized as an extension of the XAUTH protocol with two-factor authentication. Each wrapped app (also referred to as “client” below) goes through IKE/Phase 1 with the gateway (also referred to as “head-end” below). When an app is wrapped, it is provided or pre-configured with a public, unique IP address and/or fully qualified domain name for the gateway. That is, the app is given the ability to “call home.” The app first needs to prove its identity to the gateway. This process starts when the app is invoked by a user on a device and attempts to make a secure, private connection with a gateway. A user can also provide a re-connect token that is unique to the user's application and device.

As is known in the art, this is the first factor of the two-factor authentication process. The client uses a secure tunnel to transmit a device certificate to the gateway using the gateway's public IP address. The gateway examines and verifies the certificate using conventional means. If the enrollment certificate is valid and the head end is satisfied as to the user's identity, the process for the second factor begins.

With respect to the second factor of the two-factor authentication process of the XAUTH extension, the client negotiates security associations for data transmitted between the client and head end (i.e., the VPN gateway). The client proceeds with the XAUTH challenge where it proves its user name, password, MAC address, and protocol. The app also reports whether the mobile device that the client is running on is jail broken and other information, such as the name of the app running on the client. This is all done via a secure IKE tunnel. The client does this using an extension to the XAUTH protocol which can be characterized as a policy enforcement and reporting protocol, referred to as PERP. The gateway can validate this data locally by checking a database or it can use an external component, such as an AAA server or Active Directory.

Once the gateway validates this data and that the PERP reply is acceptable, the second factor of the XAUTH phase is complete. By virtue of this PERP-extended XAUTH Phase 1 stage, the wrapped app is able to report jail broken devices to the gateway which, in turn, can notify components in the employer's network (e.g., a security administrator's console). In other words, the gateway uses PERP to learn specific information about the app, the user, the device, IP address, and MAC address. Therefore, for any data received by the gateway over any of the IKE/IPSEC sessions, by virtue of PERP, the employer can learn, with certainty, which user sent the data, from which wrapped app, and from which device. It should be noted here that PERP can be used between two points over any secure protocol; it is not tied to IKE as in the described embodiment. PERP can be transported through any suitable secure protocol, including VPN/SSL. The enterprise can obtain this highly granular data without having to rely on heuristics, packet analysis, or other means, which typically result in estimates or less-than-certain results (i.e., intelligent guesses as to who, what, and where). In one embodiment, this is possible because the head-end (gateway) has a relationship with a trusted client from where the data originates. Additional details on PERP and how it can be used for seamless user authentication for an existing wrapped app, are provided below.

Once security associations are negotiated, the client initiates configuration with the gateway. In one embodiment, this is done using IKE MODE-CONFIG. At this stage, the app requests the gateway for a unique (internal) IP address. The app may also request a subnet and DNS name.

With respect to XAUTH/PERP in one embodiment, the gateway may query the client/app whether it is in a federation of wrapped apps on the device. That is, is the app one in a group of apps, referred to as a federation of apps, from the enterprise? If the client responds that it is, the gateway responds by transmitting a federation cookie to the client. In another embodiment, the gateway does not query the wrapped app regarding federation. Instead, it simply sends the app a federation cookie. If the app is not part of a federation, that is, it is the only app from the enterprise on the device (i.e., is a federation of one), the cookie subsequently may or may not be used. As described in the steps below, other apps in the federation will use this federation cookie.

Once the wrapped app has configured itself with the gateway, IKE Phase 1 is complete and a secure tunnel between the app and the gateway has been established. This policy enforcement would most likely be implemented during Phase 1 using PERP and later by monitoring traffic within the IPSec session. The gateway is now able to enforce policy (e.g., geo-fencing, copy/paste, etc.) on behalf of the employer/enterprise on users' devices. For example, the gateway can enforce a policy for a specific employee who is using a particular wrapped app and dictate that the employee only uses the app during certain times of the day and only in a certain geographic area (e.g., while on the employee's premises). The gateway can do this because it has accurate and specific data relating to the user, app, and device and can apply the one or more policies accordingly. As described below, the gateway may have a database containing app and other data that are used for policy enforcement.

The gateway is pre-configured to a specific mode, for example, IKE main mode or aggressive mode, XAUTH enabled or not enabled, and so on. As noted above, a wrapped app starts up and given that it is pre-configured with a public and unique IP address and domain name of the gateway, it is able to connect with the gateway. In main mode session, there is a six-packet exchange at the end of which the gateway and the wrapped app essentially trust each other. This is the end of 1-factor authentication after which the XAUTH/PERP exchange takes place, as described below.

During Phase 2, new security associations are negotiated via the Phase 1 tunnel and two new unidirectional Phase 2 tunnels are created between the app and the gateway; one for transmit and one for receive. As is known in the field of IPSEC, the gateway and the wrapped app create symmetric keys used by each for transmitting data through a new tunnel. At this point, Phase 2 is used for transmission of actual app data between the app and the gateway. Up to the end of Phase 2, the described embodiment utilizes: 1) the protocol for policy enforcement referred to herein as PERP, and 2) sending a federation cookie to the app for reasons described below.

As noted above, in IPSEC Tunnel Mode, the gateway gives a unique and private IP address to the wrapped app on the device. Often, a device will have more than one wrapped app and because each app goes through configuration mode, each will get a unique and private IP address. Essentially, each wrapped app registers with the gateway (one app begins by sending a message: “I'm app 1 on device A. I need a private IP address; another app on the device sends a message: I'm app 2 on device A, I need a private IP address . . . ,” and so on.). There may be three, four or more such wrapped apps, each one having to go through Phase 1 with the gateway because each is an independently operating app even though they are in the same federation. Although they may be in the same federation, effectively, they do not know about each other, other than that they are in the same federation.

As mentioned, it would be a challenging technical and logistical task for an enterprise or any entity to manage a large volume of external, unique IP addresses (e.g., “10-net” addresses). Generally, it would not be feasible to issue a unique IP address for each wrapped app on each device. At the same time, it is desirable from a security standpoint to implement a private VPN or tunnel between each wrapped app and the gateway, as described with respect to FIGS. 7 and 8. The gateway hardware and software can scale to terminate a large number of IKE/IPSEC sessions. In another embodiment, it can terminate a large volume of SSL/VPN connections. Therefore, there is a need to consolidate IP addresses.

In one embodiment, this consolidation involves bundling or consolidating IP addresses for apps in one federation on one device. For example, if there are eight wrapped apps in one federation on a user's mobile device, instead of the VPN gateway having to assign eight unique IP addresses for that device such that each client (app) has a private VPN to the gateway, the gateway can assign one unique IP address for all the wrapped apps in one federation on the device. In one embodiment, the wrapped apps are in one federation. An employee of an enterprise will likely keep all the enterprise apps that the employee uses for work, all of which will be wrapped, in one federation.

As a result of this consolidation of IP address, in the described embodiment, the gateway has methods of determining which federated wrapped app on the device should receive return traffic, wherein that traffic could be for any one of the wrapped federated apps, when that traffic is received by the device. That is, which app should be designated to receive all incoming traffic for any app in that federation and route the traffic to the appropriate federated app once in the federation. The single IP address is sufficient to determine the mobile device and app federation but not the specific mobile app since each mobile application in the federation has its own set of Phase 2 security associations with the gateway; it is the port range, used in conjunction with the IP address, which allows the gateway to select the correct out-going security association for a particular mobile app in a federation. The IP address is for a specific federation of apps and thus the traffic will at least reach the correct federation. It is important to note here that the private connection and the traffic between the wrapped apps and the gateway still utilize correct Phase 2 security associations; this is accomplished with IP address together with the port range. Thus, the data traffic is still encrypted using the symmetrical keys established between a particular wrapped app and the gateway as described above.

For purposes of illustrating methods of consolidating IP addresses and thereby enabling termination of a large number of VPN tunnels at the gateway, it is assumed that there is a federation of n wrapped apps on a device, all of them related to the user's employer (or other related entity/enterprise), so that the app federation is tied to a gateway under control of the employer. There may only be one wrapped, work-related app on a device in which case the methods described here may not be needed. In this case all that is needed is the unique IP address that the app would have anyway. However, in another embodiment, the single app may be treated as an app federation of one and still go through the processes described below for consistency, given that there is no significant drawback to doing so. When a user invokes an app in the federation, the app connects with the gateway using the methods described above with respect to the per-app VPN embodiments. The app identifies itself to the gateway and goes through Phase 1. The gateway assigns a unique IP address to the app and, in one embodiment, will query whether the app is part of a federation. The app will respond that it is (in this illustration) and the gateway then transmits a federation cookie. The gateway instructs the app to share the federation cookie with the other apps.

In one embodiment, when the app is wrapped, it is configured so that if it receives a federation cookie from the gateway (the app will know how to recognize one when it sees it), the app knows that it should make the cookie available to other apps in its federation. Other apps in the federation which attempt to register with the gateway will be challenged for this federation cookie. If another, subsequently invoked, app has the cookie, the gateway knows it can proceed with manipulating IP addresses and other data in a particular manner described below.

When an app is wrapped, it is configured to be aware that it may be given a federation cookie or that it may have to get a federation cookie (from another app in the same federation) because the app will be challenged for it by the gateway. Essentially, the federated app can be configured during wrap time to be aware of the possibility that it may have to utilize a federation cookie when communicating with the gateway.

In one embodiment, when the gateway gives the federated app that is invoked first on the mobile device a federation cookie, the gateway also gives the app a unique IP address, that is, an IP address that is unique to that mobile device. In addition to these two items—the cookie and the IP address—in one embodiment, it also gives the app a range of ports on the mobile device. This port range is used for communicating data traffic from the source IP (the mobile device) to the destination IP (the gateway) and vice versa.

The app uses this port range to restrict which source port (on the mobile device) to use when sending data from a particular federated app on the device. To illustrate, the gateway may send the app a port range of 100-199. This tells the app that it must use a port in the range of 100 to 199 when it generates and sends traffic over the private VPN connection. In this manner, when the gateway receives traffic from the device and sees that the source port (port on the mobile device) from where the traffic is originating is within the 100-199 range, the gateway knows that the traffic is coming from a specific federated app, even though the IP address is unique only with respect to the device. On the device, the port number that an app will use is assigned to the app by the operating system of the device and that port number is unique to that app.

On the enterprise side of the gateway, where the gateway receives traffic from enterprise servers (e.g., employer's internal network), the gateway can use this port range as a test for the destination port range (on the gateway) which maps to the app on the device. The primary restrictions are that the source port number (on the mobile device) be in the range assigned by the gateway and that no other app on the device use that port number when communicating over the VPN. As noted above, a federation of one app may still be assigned a port range even though it would be sufficient to use the unique IP address.

At this stage a first app in the federation is running and has a secure VPN with the gateway, has a unique IP address, and is transmitting data from a port within a port range as instructed by the gateway. Now a second app in the federation is open. This second app is challenged by the gateway for the federation cookie. Once the cookie is presented to the gateway by the second app, the app is given the same IP address that was given to the first app. The gateway does not have to generate a second unique IP address solely for that second federated app. However, the gateway gives the second app a different port range, e.g., 200-299. When the second app begins sending data through its own private VPN tunnel from a port in the 200-299 port range, the gateway will know that data it receives from any source port in the 200-299 range is from the second app in a specific federation. As noted, the gateway also instructs the second app to ensure that the port the second app uses is unique to that app so the gateway can be confident that traffic it receives that originates from a specific source port is coming from that specific app (i.e., the second app). Only that app will use a port in that range. When the app is closed and is done using the port, the port is closed and the operating system can recycle that port number. The gateway will no longer expect that only that app will use a source port in the 200-299 port range.

FIG. 9 is a block diagram showing components of a network illustrating various concepts of the present invention. Shown are a mobile device 902, a gateway 904, and an enterprise (private) server 906. There are two secure tunnels or VPNs 908 and 910 between two apps 912 and 914, respectively, between device 902 and gateway 904.

Mobile device 902 has a public IP address (assigned to it by a public carrier), referred to as the source IP (“SRC IP”). Each of the apps 912 and 914 have an internal IP address and a port range both of which are assigned by gateway 904. Gateway 904 has a public IP address (“DEST IP”) assigned by a public carrier, an internal IP address pool, and a port range data store.

There are two VPNs or secure tunnels 908 and 910. VPN 908 connects app 912 through port x 916 on mobile device 902 (port x exclusively for app 912) to gateway 904. Secure tunnel 908 is established or created using SRC IP address and DEST IP address. Similarly, secure tunnel 910 is created using SRC IP and DEST IP addresses and connects app 914 via port m 918 to gateway 904. However, the data that travels in the secure tunnels have addresses consisting of internal IP addresses and an enterprise server IP address which may be characterized as “inside the tunnel” IP addresses. As explained below, data traveling in the secure tunnels are typically between a private enterprise server, such as server 906, and a specific app. Therefore, data transmitted via gateway 904 through a secure tunnel uses the internal IP address of the app and not the public IP address of device 902.

FIG. 10 is a flow diagram of a process executed by the gateway device to determine how to route data back to a specific app from an enterprise server in accordance with one embodiment. At step 1002 the device and gateway are assigned (or have already been assigned) public IP addresses from the enterprise's Internet service provider, such as AT&T or Comcast. The one for the mobile device is the source IP address (SRC IP) and the one for the gateway is the destination IP address (DEST IP).

At step 1004 a user opens a first app on the mobile device. The gateway assigns an internal IP address from its own IP address pool to the first app. It also assigns a port range to the app (referred to herein as “first port range”). The gateway stores these data, specifically, the internal IP address and the port range assigned to the first app in a data store. At step 1006 the user opens a second app on the mobile device. The gateway assigns the same internal IP address to the second app, thereby not having to utilize another internal IP address from the gateway's limited pool of internal addresses. The first and second apps have the same gateway internal IP address. The gateway also assigns the second app a second port range which is different from the port range assigned to the first app. These data are also stored in the gateway. As described below, the data is used by the gateway to perform essentially a reverse look-up to see which app (out of all the apps which were given the same internal IP address by the gateway) should receive the data.

The user executes the first app and the app begins transmitting data to the gateway in route to an enterprise server (related to the app) operating behind the gateway in the private enterprise network at step 1008. At step 1010 the user performs the same with the second app where data is transmitted to an enterprise server (it may be the same one as the first app or another one) behind the gateway. In this example, the two apps send data to the same enterprise server.

At step 1012 the enterprise server sends data back to the gateway. This data may be in response to a request from the first app or from the second app. The private enterprise server itself does not know which app on which device the data should be sent to. It performs no look-ups or any type of address translation. It simply transmits the data back to the gateway, where the data is accompanied by an internal IP address. The data from the enterprise server to the gateway has an internal DEST IP address and a DEST PORT embedded in it (or that of the mobile app which originated the traffic).

At step 1014 the gateway receives the data from the server and examines the internal IP address and destination port of where the data is to be sent. The gateway searches in an internal or otherwise accessible data store to determine that the internal IP address it got with the data from the enterprise server corresponds to a specific federation of apps on the device and uses the destination port to look up which specific app in the federation on the device is supposed to receive the data. In one embodiment the gateway needs both these items—the internal IP address and the port number—to determine specifically which app should receive the data and which security association to use to send the data. At step 1016 the gateway sends data to the specific app through a dedicated secure tunnel between the gateway and the specific app using the app's assigned port on the mobile device.

FIG. 11 is a screen shot of a display showing a VPN gateway active session monitor screen in accordance with one embodiment. A user display 1102 shows various data and information related to users, devices, and apps connected to the gateway. A typical user of an active session query as shown in FIG. 11 may be an enterprise mobile security officer or network security administrator, both interested in learning specific and accurate information about mobile app and data usage in their enterprises.

In screen 1102 are three categories of data: session data 1104 for a selected user; device data 1106, and app data 1108. This information is for a selected user shown in box 1110. Session data 1104 shows information such as username, group, login time and date, length of current session, and the authentication provider. In this case user authentication is provided by using a reconnect token.

Another category of active session data is “Devices Used” shown in box 1106. Shown here is a device identifier (e.g., in some cases a MAC address), the hardware type of the device (which can be a smartphone, tablet, laptop, or other portable computing device), the type of operating system on the device and the version, and the name of the carrier providing the connection, if applicable. The carrier may provide the public IP addresses of the mobile device and the gateway.

The third category is data 1108 describing applications being used by the selected user. This data includes device information (same as the device data shown in box 1106), the name of the application (in this case, a browser app and another app called “NSLookup”) and related app information. This may include a package ID, UUID, and a Session ID. Recall that the apps that are monitored using the VPN gateway are security wrapped using the processes described above in FIGS. 1-6. As such, also provided is app security software version information used to wrap the app listed in box 1108. All this information provides in-depth detail that is based on actual app usage in the enterprise: who is using which apps (and what version of those apps) on which devices (and what is the state of those devices) and for how long.

Another aspect of the present invention addresses reducing complexity in the gateway, specifically in the Internet Key Exchange (“IKE”) process of the gateway. In this aspect of the invention, the number of datapath processes that perform IPSec cryptographic operations in the gateway can be scaled (significantly increased) without adding complexity to the IKE process itself. Datapath processes are the paths between a secure app on a mobile device and the gateway component (described above). It is where data packets are encrypted or decrypted depending on which direction they are being transmitted. Datapath processes primary function is to handle traffic that goes to and from the VPN tunnels between secure apps and the gateway.

In one embodiment, the VPN gateway uses a user space stack, referred to LWIP, for IP/UDP handling, forwarding look-ups, and virtual network interfaces. With the present invention, there is linear scaling up to the number of datapath processes in the gateway without placing any additional computational burden or complexity on the IKE process. The number of datapath processes is controlled by software system configuration. In the present invention, there can be any combination of datapaths per IKE process. In the future there may be multiple IKE processes.

IKE Phase 1 negotiation takes place between an app on the device (e.g., mobile phone, tablet, etc.) and the enterprise VPN gateway and results in a secure connection or tunnel between the two end points. Security Associations (“SA”) for IKE Phase 2 are negotiated using the secure tunnel created in IKE Phase 1. Tunnels are defined as virtual interfaces (IP layer) where data is either encrypted or decrypted depending on which direction it is flowing. The security associations allow a secured app to communicate securely with an intended end point, such as a Web server, intranet, app server, or other internal enterprise server, such as those described above, behind the gateway. These are the communications that are transmitted via the VPN established after Phase 2 is complete.

One component of the present invention is a security policy database (SPD) which stores policies. Another component is the security parameter index (SPI) which, as described, point to security associations. Typically, two SPIs are negotiated between the secure app and the gateway during IKE Phase 2. The SAs are generated by the secure app and the gateway for data that the app wants to encrypt or decrypt. In one embodiment, an SPI is a unique identifier that essentially points to an SA. That is, data going from the gateway to the app (on the mobile device) has one SPI and data going from the app to the gateway has another SPI. Each SPI references an entry in an SPD/SA cache. As is known in the art, an SA includes encryption keys, types, algorithms, and the like. In one embodiment, the IKE process populates the SPD/SA cache with SPIs and SA; it does not distribute SPIs, SAs, or any other data directly to the datapath processes. The SPD/SA cache dictates which packets are allowed to go through the gateway and onto the internal enterprise servers. The packets are either decrypted or encrypted using the policy (SA) specified in the SPD/SA database.

In one embodiment, the gateway receives a unique data flow of packets and a packet header is queried for an SPI identifier. This SPI value is used, first, to search the IPSec local cache for a policy. If the policy entry in the local cache specifies a policy for decryption or encryption, these policy instructions are used to encrypt or decrypt (i.e., how such crypto operations should be performed). As noted, this information is negotiated and obtained during IKE Phase 2 negotiation with the secured app on the device. The gateway also uses the local cache entry to see if the gateway should let a packet into the tunnel (encrypting the packet) or out of the tunnel (decrypting the packet) based on instructions in the policy/SA.

If the SA is not in the IPSec local cache (there's a miss), the gateway searches the SPD/SA cache to determine which the SA to utilize within a datapath instance. Security associations typically include key, cipher, lifetime, hash, and other data. Once the gateway knows the SA, the gateway can retrieve them from the SPD/SA cache (i.e., get them “off the shelf”) and use them on the datapath instance. As noted, after IKE Phase 2 is negotiated, IKE installs the SPD/SA entry in the cache and populates the entry with SA information. The SPI values refer or point to previously negotiated security associations. The SA entry is pointed to by the SPI entry which is used by IPSec in performing the local (first) look up.

In the described embodiment, datapaths are Linux processes. Within each datapath there is an IPSec sub-component library. Packets transmitted between a secure app on a mobile device and datapaths in the gateway are either encrypted or decrypted and authenticated in both directions.

As described above, the present invention involves a two cache configuration in the gateway, the SPD/SA cache (aka, IKE centralized cache) and the IPSec local cache. The IKE process maintains a centralized cache and each datapath (IPSec instance) maintains a local cache storing a small subset of information in the IKE centralized cache. Within each cache are an SPD component and an SA component. For each packet, the datapath first does a look-up in the SA component (as pointed to by the SPI extracted from the packet) of the corresponding IPSec local cache. If there is a hit in the local cache, then the datapath can retrieve SA information from there, that is, instructions on what crypto operations to perform on the packet, what algorithms to use, and the like. There is no need for external intervention, specifically, from the SPD/SA cache (the IKE centralized cache).

If there is a miss for an SA in the local cache for the IPSec packet, then the gateway searches the SPD/SA cache which may be described as the high-level authority of what has been negotiated by the IKE process. If an SA is found in the IKE centralized cache, the SA is then loaded into the datapath local IPSec cache so that subsequent IPSec packets having the same SPI can find it locally. If the SA is not in the centralized cache, the packet is dropped. This two-step cache look-up process for obtaining security associations and performing crypto operations greatly reduces the complexity and processing load of the IKE process. This enables a greater number of datapath processes for each IKE process; that is, scaling the number of datapaths managed by the IKE process by removing complexity inherent in datapath processes.

As described above, the SPI, extracted by the gateway from the packet header, is used to do the look-up in the SA component of the local cache and, if not found there, in the SPD/SA cache. For non-IPSec return path traffic, the datapath hashes the packet header to determine the SPI. This key is used to search the SA components of the local cache and the SPD/SA cache. The gateway can determine whether the packet is allowed to be transmitted via the tunnel or not. In one embodiment, as noted, if there is a miss in the local IPSec cache and a hit in the SPD/SA cache, the gateway stores the SA in the local IPSec cache.

In one embodiment, IKE does not inquire about replacing dropped datapaths. If there is a failure in a datapath (i.e., no SA), the SA flow transitions to another datapath. This enables high availability of datapaths. Instead of IKE inquiring about dropped datapaths and transitioning, the IPSec instance in the new idle datapath makes the inquiry. Note that IKE still negotiates with the wrapped app and supplies SPD and SA data to the SPD/SA cache. Datapaths then query this IKE central cache and load their local caches based on hits in the central cache.

FIG. 12 provides an overview of some of the relevant components of the present invention as already described above. An IKE process 1202 has two APIs relevant to the described embodiment, an IPSec Client API 1204 and a Datapath Client API 1206. IPSec Client API 1204 enables IKE 1202 containing a centralized cache 1210. API 1204 and centralized cache 1210 are both able to communicate with SPD/SA cache 1210. As described above, IKE process 1202 populates cache 1210 after negotiating SA with a wrapped app. Cache 1214 is updated by IPSec 1212 querying centralized cache 1210 when needed, specifically, when there is a miss in local cache 1214 during the first look-up and a hit in SPD/SA cache 1210 in the second look-up.

FIG. 13 is a flow diagram of a process of obtaining security association data for a packet without involving the IKE process in accordance with one embodiment. At step 1302 the gateway receives a unique data flow from a wrapped app on a computing device, such as a phone or tablet, via a dedicated VPN between the app and the gateway. At step 1306 the gateway determines whether the data is comprised of IPSec packets. If they are, control goes to step 1308 where the gateway extracts an SPI from the packet. In one embodiment, it extracts this from the packet header at step 1318. At step 1310 a look-up is performed using the SPI for policy in the local IPSec cache. If this results in a hit and the SA is found there, control goes to step 1312 where the gateway performs any necessary crypto operations on the data packets using the SA found in the local cache.

Returning to step 1306, the gateway determines whether the data flow is comprised of IPSec packets. If it is, control goes to step 1308 and the process progresses. If the data packet is not an IPSec packet, control goes to step 1318 where the packet header is hashed into a SPI value. Logic proceeds to step 1310 where a lookup is performed in the local IPSec cache and commences as described above.

If the local cache does not contain the SA, a miss, control goes to step 1314 where the datapath does a search using the SPI in the SPD/SA cache, also referred to as the IKE centralized cache, the high-level authority on what SA's have been negotiated by IKE in Phase 1. If the SPD/SA cache has an SA entry for the packet, the required crypto operations are performed on the packet using the identified SA as shown in step 1312. In addition, the datapath updates or populates the local cache with the SA entry so that when subsequent packets come in with the same SPI, the gateway can find the SA in the local cache. If the SA is not located, the packet is dropped at step 216 and the process is complete.

The packets are either encrypted or decrypted using the security associations using a two-level cache configuration outside of the IKE process. As such, the complexity of the crypto operations for a datapath is removed from IKE thereby enabling the IKE process to scale, in other words, handle a significantly larger volume of datapaths, i.e., VPN connections from wrapped apps. In addition, the process is efficient given that the gateway may in many cases find the SA for a data packet locally in the IPSec local cache and need not look in the IKE centralized cache, let alone require any resources of the IKE process.

FIG. 14 is a block diagram showing data and interactions between the SPD/SA cache and the IPSec local cache. Shown are an IKE process 1402 and an IPSec Client API 1404. Shown are four entries 1406 in the SA portion 1414 of the local cache in IPSec 1410. These entries include inbound and outbound SPI keys and two IKE SA keys. Also shown are four entries 1408 in the SPD portion 1412 of the local cache in IPSec 1410. Block 1408 includes traffic rules and SPD keys which is used in the non-IPSec packet flow where an SPI must be determined based on hashing the non-IPSec packet header. Blocks 1412 and 1414 are stored in the Datapath process within its IPSec component local cache.

FIG. 15 is a block diagram showing in more detail what data is populated in the SPD/SA cache in accordance with one embodiment. An inbound SPI and SA 1502 and an outbound SPI and SA 1504 point to an IKE SA 1506. Within 1506 are the two security associations negotiated between the gateway IKE process and the application noted as “inbound/outbound-blob.” The “blob” contains keys and cryptographic agreements between the IKE process and the secured app. In addition to keys and cryptographic agreements, traffic selectors are also negotiated between the gate IKE process and the application; these traffic selectors are stored as Traffic Rules within the SPD and are accessed via 1508 and 1510.

In one embodiment, the LWIP “netif” opens up a raw socket with a special Berkley Packet Filter (BPF) for sending and receiving data. Each datapath has a different BPF and uses it to implement load balancing.

FIGS. 16A and 16B illustrate a computing system 1600 suitable for implementing embodiments of the present invention. FIG. 16A shows one possible physical form of the computing system. Of course, the computing system may have many physical forms including an integrated circuit, a printed circuit board, a small handheld device (such as a mobile telephone, handset or PDA), a personal computer or a super computer. Computing system 1600 includes a monitor 1602, a display 1604, a housing 1606, a disk drive 1608, a keyboard 1610 and a mouse 1612. Disk 1614 is a computer-readable medium used to transfer data to and from computer system 1600.

FIG. 16B is an example of a block diagram for computing system 1600. Attached to system bus 1620 are a wide variety of subsystems. Processor(s) 1622 (also referred to as central processing units, or CPUs) are coupled to storage devices including memory 1624. Memory 1624 includes random access memory (RAM) and read-only memory (ROM). As is well known in the art, ROM acts to transfer data and instructions uni-directionally to the CPU and RAM is used typically to transfer data and instructions in a bi-directional manner. Both of these types of memories may include any suitable of the computer-readable media described below. A fixed disk 1626 is also coupled bi-directionally to CPU 1622; it provides additional data storage capacity and may also include any of the computer-readable media described below. Fixed disk 1626 may be used to store programs, data and the like and is typically a secondary storage medium (such as a hard disk) that is slower than primary storage. It will be appreciated that the information retained within fixed disk 1626, may, in appropriate cases, be incorporated in standard fashion as virtual memory in memory 1624. Removable disk 1614 may take the form of any of the computer-readable media described below.

CPU 1622 is also coupled to a variety of input/output devices such as display 1604, keyboard 1610, mouse 1612 and speakers 1630. In general, an input/output device may be any of: video displays, track balls, mice, keyboards, microphones, touch-sensitive displays, transducer card readers, magnetic or paper tape readers, tablets, styluses, voice or handwriting recognizers, biometrics readers, or other computers. CPU 1622 optionally may be coupled to another computer or telecommunications network using network interface 1640. With such a network interface, it is contemplated that the CPU might receive information from the network, or might output information to the network in the course of performing the above-described method steps. Furthermore, method embodiments of the present invention may execute solely upon CPU 1622 or may execute over a network such as the Internet in conjunction with a remote CPU that shares a portion of the processing.

Although illustrative embodiments and applications of this invention are shown and described herein, many variations and modifications are possible which remain within the concept, scope, and spirit of the invention, and these variations would become clear to those of ordinary skill in the art after perusal of this application. Accordingly, the embodiments described are to be considered as illustrative and not restrictive, and the invention is not to be limited to the details given herein, but may be modified within the scope and equivalents of the appended claims. 

What I claim is:
 1. A method of communicating through a virtual private network (VPN) tunnel between a first application (app) on a device and a Virtual Private Network (VPN) gateway having an Internet Key Exchange (IKE) which performs cryptographic operations on data packets comprising: transmitting from the gateway a first range of ports to the first app, wherein the first app uses a port in the first port range as a source port for data transmission from the first app to the VPN gateway, wherein the first port range comprises a plurality of ports not included in a second port range transmitted to a second app having a same internally unique IP address as the first app; receiving, at the VPN gateway, data packets from the first app; and determining, at the VPN gateway, that the data transmission originated from the first app based on the source port; searching a local cache in an IPSec component of a datapath for security associations for the data packet; searching a security policy and security association cache for security associations for the data packet; and performing cryptographic operations on the data packet using the security associations outside of the IKE process in the gateway.
 2. A method as recited in claim 1 further comprising: extracting the security parameter index from a header of the data packet.
 3. A method as recited in claim 1 further comprising: populating the security policy and security association cache in IKE with security association data negotiated by IKE during an IKE Phase 2 stage.
 4. A method as recited in claim 1 further comprising: updating the local cache with security association data when there is a miss in the local cache and a hit in the security policy and security association cache for security association data.
 5. A method as recited in claim 1 further comprising: determining whether a data packet is an IPSec data packet or a non-IPSec data packet.
 6. A method as recited in claim 5 further comprising: hashing the packet header to obtain a security parameter index if the packet is a non-IPSec data packet.
 7. A method as recited in claim l further comprising routing the data packet to a datapath in the gateway.
 8. A method as recited in claim 1 wherein said searching of a security policy and security association cache is performed if there is a miss in searching of the local cache in the IPSec component of the datapath.
 9. A method as recited in claim 1 wherein a subsequent data packet received by the gateway having a second security parameter index that is the same as the security parameter index of the first received data packet obtains security association data from the local cache and that is the same as security association data for the first received data packet. 